Picture for Yichao Wu

Yichao Wu

Early Diagnosis of Wasted Computation in Multi-Agent LLM Systems via Failure-Aware Observability

Add code
May 31, 2026
Viaarxiv icon

Architecture Matters More Than Scale: A Comparative Study of Retrieval and Memory Augmentation for Financial QA Under SME Compute Constraints

Add code
Apr 20, 2026
Viaarxiv icon

SELF-EMO: Emotional Self-Evolution from Recognition to Consistent Expression

Add code
Apr 20, 2026
Viaarxiv icon

TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA

Add code
Mar 10, 2026
Viaarxiv icon

Tiny-Critic RAG: Empowering Agentic Fallback with Parameter-Efficient Small Language Models

Add code
Mar 01, 2026
Viaarxiv icon

Language-based Trial and Error Falls Behind in the Era of Experience

Add code
Jan 29, 2026
Viaarxiv icon

MMRPT: MultiModal Reinforcement Pre-Training via Masked Vision-Dependent Reasoning

Add code
Dec 08, 2025
Figure 1 for MMRPT: MultiModal Reinforcement Pre-Training via Masked Vision-Dependent Reasoning
Figure 2 for MMRPT: MultiModal Reinforcement Pre-Training via Masked Vision-Dependent Reasoning
Figure 3 for MMRPT: MultiModal Reinforcement Pre-Training via Masked Vision-Dependent Reasoning
Viaarxiv icon

Erase to Improve: Erasable Reinforcement Learning for Search-Augmented LLMs

Add code
Oct 01, 2025
Viaarxiv icon

StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization

Add code
May 21, 2025
Viaarxiv icon

Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training

Add code
May 11, 2024
Figure 1 for Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training
Figure 2 for Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training
Figure 3 for Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training
Figure 4 for Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training
Viaarxiv icon